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(54) Title: PROTEIN COMPLEMENTATION IN TRANSGENIC PLANTS 



(57) Abstract 

The invention relates to pairs of parent plants 
for producing hybrid seeds and to methods for pro- 
ducing plants with a desired phenotype. The desired 
phenotype is an active enzyme, a regulatory protein or 
a protein which affects the functionality and/or viabil- 
ity and/or structural integrity of a cell. Preferably, the 
desiredphenotype is substantially absent from the par- 
ent plants/lines. In particular, the invention relates to 
parent plants and methods involving plant lines for pro- 
ducing male-sterile plants and seeds. 
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PROTEIN COMPLEMENTATION IN TRANSGENIC PLANTS 

This invention relates to pairs of parent plants for 
producing hybrid seeds and to methods for producing 
5 plants with a desired phenotype. The desired phenotype 
is an active enzyme, a regulatory protein or a protein 
which affects the functionality and/or viability and/or 
structural integrity of a cell. Preferably, the desired 
phenotype is substantially absent from the parent 
10 plants/lines. In particular, the invention relates to 
parent plants and methods involving plant lines for 
producing male-sterile plants and seeds. 

The present invention describes a protein 
15 complementation system, with a variety of different 
applications. The system can be explained and 

exemplified with reference to obtaining male-sterile 
plants and embryoless seeds although it is not limited 
to these applications. 

20 

The use of dominant Artificial Male Sterility (AMS) in 
plants is described in WO95/20668. This document 
describes a binary system using two genes which together 
(but not in isolation) cause male sterility. The genes 
25 are brought together by crossing plants, each parent 
being homozygous for the gene, which generates a 
homogenous population of male sterile plants. 
WO95/20668 describes several ways to implement the gene 
binary system, including the following: 

30 

i. a system based on activation of transcription: a 
transcriptionally inactive AMS gene is activated 
upon crossing by provision of the relevant 
transcription factor; 
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ii. a system based on activation of splicing: an AMS 
gene inactivated by the presence of an intron is 
activated upon crossing by provision of the 
5. relevant maturase; 



iii. a system based on the suppression of a stop codon 
during translation: an AMS gene inactivated by 
introducing an artificial stop codon in the ORF, is 
10 activated upon crossing by provision . of an 

artificial stop suppressor tRNA for the introduced 
stop codon. 

a system based on sequence-specific gene 
inactivation: One parent contains a modified male 
fertility gene and a transgene which inactivates 
only the unmodified male fertility gene. The other 
parent contains a transgene which inactivates only 
the modified male fertility gene. In the hybrid 
both the modified and unmodified male fertility 
genes are inactivated causing male sterility. 

a system based on preventing restoration of male 
fertility by a restorer gene: the first parent 
contains the AMS gene and the restorer gene, and 
the second parent contains a gene inhibiting the 
action of the restorer gene. 

However, the binary systems described above have so far 
3 0 proved complex to implement and have encountered a 
variety of difficulties. 



IV. 



15 



20 



v. 

25 



For example, it has been found that the use of a 
suppressor tRNA (described in Betzner et al. 1996, 
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Abstract of the 14th International Congress of Plant 
Reproduction, Lome, Australia) can have deleterious 
consequences for some plant species. While this dees 
not preclude its use, it does make the screening of 
5 suitable transgenic plants more labour intensive than 
desirable. Another example is the leakiness of the T7 
promoter (described in EP-A-0589841) . Some plants 
transformed with a T7 promoter driving Barnase were 
sterile in the absence of the T7 RNA polymerase. Again, 
10 this does not preclude use of the system but it does 
make it difficult to identify suitable transgenic 
plants. Furthermore, in certain plants the gene binary 
system is sub-optimal since not all of the required 
genetic elements are fully characterised. 

15 

Two areas of prior art have been explored which have 
resulted in a phenotype conferred to a plant by the 
combination of two proteins. 

20 In 1989, Hiatt and coworkers (Nature, vol. 342, p. 76- 
78) described the production of a functional antibody in 
tobacco by crossing tobacco plants expressing a gamma 
immunoglobulin gene and a kappa immunoglobulin gene. 

25 Problems were, however, encountered with this system. 
Since the light and heavy chains of an antibody interact 
through disulfide bridges, the bridges were unable to 
form in the reducing environment present in the 
cytoplasm. Assembly of a functional antibody in plants 

30 thus requires that both chains are targeted to the 
endoplasmic reticulum then secreted to the apoplast (the 
space between cells) . The production of antibodies in 
plants has thus been limited to the production of 
secreted antibodies or the production of single chain 
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antibodies. 

In 1992 Lloyd et al . (Science, vol. 258, p. 1773-1775) 
described the transfer in Arabidopsis and tobacco of two 
5 maize genes coding for the transcription factors R and 
CI. Ectopic expression of these genes separately in 
heterologous plants has some effect on the transcription 
of endogenous genes. In particular the genes have some 
effect in isolation, and this may preclude their use for 
10 applied purposes. Co-expression of the two genes had 
more dramatic qualitative and quantitative effects, than 
expression of either gene alone. However, these genes 
have properties severely limiting their usefulness and 
their general inapplication is described in the paper. 

15 

It has been shown that the Arabidopsis transcription 
factors Apetala3 and Pistillata can be ectopically co- 
expressed, and jointly in concert cause a new phenotype 
in the Arabidopsis flower (Krizek and Meyerowitz, 1996, 
20 Development, vol. 122, p. 11-22). The limitations 
described above for the R/Cl proteins also apply in this 
case. 

The present invention describes a protein 
25 complementation system which overcomes many of the 
problems and difficulties associated with known gene 
binary systems. The protein complementation system 
according to this invention is based on the expression 
of two or more gene sequences in a single plant, which 
30 polypept ides/proteins, associate, interact or come 
together to form an active enzyme, a regulatory protein 
or a protein which affects the functionality and/or 
viability and/or the structural integrity of a cell. 
Hereinafter, in this text all references to a protein 
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which affects the structural integrity of a cell also 
describes a protein which may, in addition, or 
alternatively, affect the functionality and/or viability 
of a cell. Some polypept ides/proteins may fall in more 
5 than one of these categories None of the individual 
gene sequences present in a given plant lead to a 
significant phenotypic effect in these plants. 

The present invention describes the creation of a plant 
10 which has a desired phenotype through expression of an 
active enzyme, regulatory protein or protein which 
affects the structural integrity of a cell (eg. a 
membrane destabilizing protein) . The plant may be 
obtained by crossing a pair of parent plants a and b. 
15 Plant a contains one or more gene sequences which encode 
a polypeptide (s) or protein(s) (A) with little or no 
activity so that the desired phenotype is not 
significantly (or substantially) caused by expression of 
the one or more genes in plant a alone. Plant b also 

2 0 contains one or more gene sequences which encode a 

polypeptide (s) or protein(s) (B) also, with little or no 
activity so that the desired phenotype is not 
significantly (or substantially) caused by expression of 
the one or more genes in plant b alone. When plants a 
25 and b are crossed, the resulting hybrid expresses both 
polypeptides and/or proteins A and B. These two 
polypept ides/proteins associate, interact or come 
together to form an active enzyme, regulatory protein or 
protein which affects the structural integrity of the 

3 0 cell, with the result that the daughter plant displays 

the desired phenotype. NB: From hereon, when discussing 
the polypeptides/proteins A or B they will be referred 
to only as 'polypeptides 1 for the sake of convenience. 
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This protein complementation binary system is simpler 
than the previously described binary systems since there 
is no need for interaction between genes, no required 
modification of the expression of genes and no 
5 modification of the level of expressed polypeptides in 
the daughter plant compared to the parent plants. 

The present invention is described with reference to the 
Figures which are: 

10 

FIGURE 1A; Barnase coding sequence; 
FIGURE IB; Intergenic sequence; 
FIGURE 1C; Barstar coding sequence; 

FIGURE ID; Translational fusion of ORF Peptide A**/ (Gly4 
15 ser)3 Linker peptide / GUS; 

FIGURE IE; Nucleotide sequence of Translational fusion 
of Ubiquitin genomic sequence and ORF Peptide A***; 
FIGURE IF; Nucleotide sequence of Translational fusion 
of Ubiquitin genomic sequence and ORF peptide B*** 

2 0 FIGURE 1G; DNA sequence of IPCR (inverse polymerase 

chain reaction) primers (example 1) 
FIGURE 2; Schematic illustration of pepA* 
and pepB* construction by Inverse PCR (IPCR) 
FIGURE 3A; In vitro construction from synthetic 
25 oligonucleotides of S-peptide, S (+5) -protein and S- 
protein; 

FIGURE 3B; In vitro construction from synthetic 
oligonucleotides of the sequence encoding the S -peptide 
and the (Gly4-Ser)3 linker; 

3 0 FIGURE 4A; protein and DNA sequences of S -peptide and S- 

peptide with (Gly4-Ser) 3 linker; 
FIGURE 4B; protein and DNA sequences of S (+5) -protein 
and S-protein. 

FIGURE 4C(i); PCR amplification product encoding partial 
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A0X3 targeting signal; 

<ii) ; ORF encoding A0X3 targeting sequence 
(underlined) and S-peptide 

(iii) ; ORF encoding AOX3 targeting sequence 
5 (underlined) and S-peptide/ (Gly4 Ser)3/GUS 

(iv) ; ORF encoding A0X3 targeting sequence 
(underlined) and S -protein 

(v) ; translational fusion of Ubiquitin genomic 
sequence and ORF of S -protein; 

10 FIGURE 4D; nucleotide sequence of IPCR primers (example 
3) 

FIGURE 5; production scheme for embryoless maize grains. 

Embroyoless seeds harvested from female rows only = 100% 
15 of embryoless maize seeds 

or 

Seeds harvested from all the field plants = 
20 approximately 80% of embryoless maize seeds: 

note that if this sort of seeds harvesting is suited a 
random sowing with 10% of male plants and 90% of female 
plants is desirable and possible. 

25 

Legend 

male parent A 
expressing pepA* in embryos 
30 Genotype: emb-pepA*/emb-pepA* 

or 

emb-pepA* linked to Herbicide 
resistance/emb-pepA* linked to 
herbicide resistance 
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female parent B 
expressing pepB* in embryos only- 
Genotype: emb-pep*/emb-pepB* in a 
5 male sterile cytoplasmic environment 

or 

emb - pepB * / emb - pepB * 
Artificial Male Sterility linked to 
Herbicide Resistance/+ 

10 

According to a first aspect of the invention there is 
provided a pair of parent plants for producing seeds 
comprising: 

15 (i) a first parent plant containing one or more 

gene sequences encoding a polypeptide A; and 
(ii) a second parent plant containing one or more 
gene sequences encoding a polypeptide B; 

20 wherein the polypeptides A, B, when expressed separately 
in different plants, do not form an active enzyme a 
regulatory protein or other protein which affects the 
structural integrity of the cell but when expressed in 
the same plant do form an active enzyme, regulatory 

25 protein or other protein which affects the structural 
integrity of the cell. Presence of the active enzyme, 
regulatory protein or protein which affects the 
structural integrity of the cell in a single plant, is 
the desired phenotype. 

30 

The present invention includes the scenario of inter- 
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extra-genic repression/complementation/ suppression; that 
is, where a mutation in one subunit of a multi-subunit 
complex can complement a mutation in another sub-unit in 
order to restore the active enzyme, regulatory protein 
5 or protein affecting the structural integrity of the 
cell. In such a scenario, the polypeptide (s) /protein(s) 
A and B may be the same in the two parent plants, with 
the exception of the different mutations. Examples 
include the E.coli regulatory proteins as described by 
10 Tokishita S.I., and Mizuno T. , 1994, Mol. Microbiol. 
(UK), 13/3, 435-444 and the GroES and GroEL proteins of 
E.coli as described by Zeilstra-Ryalls J., et al., 1994, 
J*. Bacterid. (US), 176, (21), 6558-65. 

15 In the present invention, the pair of parent plants can 
be described as a pair of complementary plants for 
producing hybrid seeds or even a pair of complementary 
transgenic plants for producing transgenic hybrid seeds. 

20 It is most likely that at least one of the pair of 
parent plants is transgenic. When used herein the term 
'transgenic 1 refers not only to genetic material from 
another species but to genetically manipulated DNA from 
the same plant or species. The genetic manipulation of 

25 the plant may be by a microbiological process such as 
AgroJbacteritun tumefaciens (Horsch R.B., Fry J.E., 
Hoffman N.L., Eichholtz D., Rogers S.G., Fraley R.T., 
(1985), Science, 227 : 1229-1231)). Alternative 
manipulations include biolistic transformation, a 

3 0 technique also well known in the art, the use of 
Agrobacterium rhizogenes, particle gun, electroporation 
polyethylene glycol or silica fibers. 

The present invention may be applied to any plant, in 
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particular, maize, wheat, tomato, oilseed rape, barley, 
sunflower, linseed, peas, beans, melon, pepper, squash, 
cucumber and egg plant (aubergine) and other broad acre 
plants'. 

5 

Use of the term "one or more gene sequences encoding a 
polypeptide...." refers to any number of stretches of 
genetic material (preferably DNA) which can encode one 
or more peptides/polypeptides/proteins . Thus 

10 "polypeptides" A or B can actually comprise more than 
one amino acid sequence which may or may not be linked 
or associated. There is no restriction on the location 
in the parent plant genome of the one or more gene 
sequences. Where more than one gene sequence is 

15 present, encoding for more than one 

peptide/polypeptide/protein, the relationship between 
the encoded sequences in each parent plant is only 
relevant to the extent that the parent plant does not 
display the desired phenotype (to any significant 

2 0 level) . When the one or more gene sequences encoding a 
polypeptide A are expressed in the same plant as the one 
or more gene sequences encoding polypeptide B, then the 
result, according to the invention is the phenotype of 
an active enzyme, a regulatory protein or a protein 

2 5 which affects the structural integrity of a cell. 

Proteins which affect the structural integrity of a cell 
include proteins that destabilise or create holes or ion 
channels in cellular membranes. 

3 0 A particular application of the present invention is the 

production of male-sterile plants. Accordingly, the 
polypeptides A, B when expressed in the same plant may 
cause male-sterility by ablation of the tapetum. An 
alternative application, also of the first aspect of the 
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invention is the expression of polypeptides A, B in the 
same plant which form an active enzyme, a regulatory 
protein or protein which affects the structural 
integrity of a cell, which, through cell ablation in a 
5 specific tissue results in a different phenotype, as 
described below. 

In addition to causing male-sterility, potent hydrolases 
like Barnase can be used for other applications where 

10 cell ablation is needed, for example to remove an 
unneeded organ from a hybrid crop. This may contribute 
to reducing downstream processing costs. One example is 
the production of embryoless seeds, which is now 
described as follows: In the production of flour (from 

15 wheat) or semolinas (from maize or wheat) or corn flakes 
(from maize) or for other uses, it would be desirable to 
have seeds with no embryo. The use of embryo specific 
promoters in the first aspect of the invention above 
would enable ablation of embryos in seeds, in a cross 

20 dependent manner. That is, in the seeds produced by the 
plant containing one or more gene sequences encoding 
polypeptide A, pollinated with pollen from a plant 
containing one or more gene sequences encoding 
polypeptide B. Self pollination of plant a has to be 

25 prevented, for example by making plant a male-sterile. 
A possible production scheme for valuable embryoless 
maize grains would be the following: generate a plant 
containing one or more gene sequences encoding 
polypeptide A (plant a) and a plant containing one or 

30 more gene sequences encoding polypeptide B (plant b) , 
designed so that combination of polypeptide A and 
polypeptide B in one seed results in embryo ablation. 
Figure 5 shows a production scheme for embryoless maize 
grains according to the invention. 
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The biochemical composition of plants can also be 
manipulated according to the first aspect of the 
invention, for example by fatty acid biosynthetic 
5 enzymes. Where the presence of an unusual but valuable 
fatty acid in the plant has a deleterious effect on the 
plant, it would be useful to be able to produce seeds 
with the unusual (fatty acid) oil through a cross 
between two lines having a normal (or quasi normal) oil 

10 composition (to the extent that each parent line is not 
deleteriously effected) . Splitting the enzyme 

responsible for the valuable fatty acid biosynthesis in 
two or more inactive parts, provides a practical way of 
producing the seeds with the valuable oil. Where the 

15 enzyme responsible for the desired trait is 
heteromultimeric, separating the genes from the. various 
monomers in the two parent plants is a simple way to 
implement the invention. More generally, this invention 
can be used to obtain hybrid seeds or hybrid plants with 

20 a particular phenotype which neither parent has. In 
particular, this invention can be used to create hybrid 
plants, resistant to a herbicide, via the crossing of 
two parent plants. Each of the parent plants expresses 
one or more non- functional parts of an active enzyme, 

25 regulatory protein or protein which effects the 
structural integrity of a cell, which is directly or 
indirectly responsible for herbicide resistance. As the 
one or more genes in each parent plant responsible for 
the trait will segregate independently, this will result 

30 in the gametes of such hybrid plants (especially pollen 
grains) giving rise to a lower transfer of the herbicide 
resistance trait to relatives or to weeds (in comparison 
with a classical single gene) . If the hybrid seed is 
the harvested desirable product, expression of the 
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desired trait would be restricted to the seed endosperm 
and embryo since these tissues are genetically hybrids. 

The active enzyme, regulatory protein or protein which 
5 affects the structural integrity of a cell is preferably 
localised to a tissue specific (ie. present only in a 
selected tissue) . This requires that one or both of the 
gene sequences encoding the polypeptides A, B are 
operatively linked to an appropriately stimulated 

10 promoter, eg. a tissue specific promoter so as to 
produce the desired phenotype. Where only one of the 
polypeptides is limited to expression in a selected 
tissue, the other polypeptide requires constitutive 
expression or at least an expression pattern which 

15 overlaps with that of the first polypeptide. 

As described above, the expression may be seed or embryo 
specific and promoters for such tissue specificity are 
well known in the art. In the case of male-sterility, 

20 the promoter is preferably tapetum specific. Such 
promoters known in the art include the TA29 promoter 
(EP-A-0344029) , the A9 promoter (Paul et al 1992, Plant 
Molecular Biology, vol. 19, p. 611-622) and the 
promoters described in W095/29247. In order for 

25 heterozygous plants to have the desired phenotype, 
promoters must be active at the sporophytic level. 

The choice of gene sequence for producing an active 
enzyme, regulatory protein or protein which affects the 
30 structural integrity of a cell depends, of course, on 
the desired phenotype. Any gene sequence encoding an 
active enzyme, regulatory protein or protein which 
affects the structural integrity of a cell can be used 
provided that the protein activity can result from the 
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association , interaction or combination of two or more 
polypeptides encoded by two or more gene sequences and 
that their activity can result in the desired phenotype. 
Immediately obvious proteins which can be suitable are 
5 those which are naturally encoded by two or more 

polypeptides and which self -assemble to form the final 
protein structure. The individual polypeptide units 
(subunits) should have no significant activity in vivo. 

10 Suitable proteins for use according to the invention 
include natural heterodimeric proteins such as the Cl-R 
maize proteins and the Apetala3-Pistillata (Ap3-Pi) 
Arabidopsis thaliana proteins. When present in the 
tapetum, the dimer protein Ap3-Pi can activate genes 

15 responsive to this transcription factor (which would 
normally be inactive because this transcription factor 
is normally absent from, or present at a low level in, 
the tapetum) . The activated gene is preferably, but not 
necessarily, endogenous to the plant of interest. For 

2 0 example, expressing the dimer Ap3-Pi .in the tapetum of 
maize will activate transcription of genes normally 
involved in flower development in other floral organs, 
and will prevent normal pollen maturation. The level of 
sterility of such a system can be improved by also 

2 5 engineering into the daughter plant a gene sequence 

which is affected by the produced active enzyme or 
regulatory protein. 

One example is the introduction into one of the parent 

3 0 lines of a gene sequence from Barnase or PR-Glucanase 

under the control of the Apetala3 promoter (pApetala3) . 
The Apetala3 promoter is responsive to the Ap3-Pi dimer 
and thus expression of the Barnase or PR-Glucanase 
protein occurs in the daughter plant. Such a system 
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provides for the enhancement of plant male -sterility 
with the additional advantage of being under a strict 
control mechanism (via the pApetala3) . Thus, the cause 
of the desired phenotype may be direct, ie, a direct 
5 result of the active enzyme, regulation protein or 
protein which affects the structural integrity of a 
cell, or may be indirect, ie. acting via an intermediate 
factor. Other transcription factors, for use in the 
invention, exist already as, or can be engineered to, a 
10 heterodimeric form, for example using the dimerisation 
domains described below. These include artificial 
transcription factors made by the association of a DNA 
binding domain and an activation domain of different 
origins . 

15 

An alternative use of the Apetala3 -Pistillata system, is 
the complementation of mutations in sub-units of the 
proteins. For example, one parent plant may express 
both proteins but with a mutation in one or the other so 

20 that the plant does not have the active dimer. The 
other parent plant may also express both proteins, in 
this case, a mutation being in the other protein. The 
second parent plant would not express the active dimer. 
A cross between the two parent plants would result in 

25 expression of genes to produce an active dimer. 

Ectopic expression of the subunits for these 
transcription factors can be used to modulate expression 
of their target gene and cause male sterility or other 
30 traits (including pleiotropic effects) in a cross- 
dependent manner. 

It is also possible to use, according to the first 
aspect of the invention proteins which have to be 
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"artificially"' split into two or more nucleic acid 
coding sequences. The resulting polypeptides/proteins 
must associate, assemble, interact or come together 
when expressed in the same plant to form an active 
5 enzyme, regulatory protein or protein which affects the 
structural integrity of a cell. Such artificial 
splitting of enzymes and proteins is today easily 
achieved by predicting where the protein can be split 
into two or more domains, for example predicting by 

10 structural biochemistry such as X-ray crystallography, 
functional protein analysis in mutants, structure 
prediction from sequence analysis or by limited 
proteolysis, amongst other techniques. In this way, the 
random coil or other suitable regions are identified as 

15 places where the protein may be split. 

Examples of artificially split proteins include: 

Barnase: This protein has been widely used to cause cell 
20 ablation, when expressed in specific tissues. Under the 
control of a tapetum specific promoter, expression of a 
Barnase gene causes male-sterility in many plant species 
(EP-A-0344029) . It is known that the Barnase protein 
can be split into two polypeptides, which per se have no 
25 catalytic activity [in vitro] . When put together the 
two polypeptides can self -assemble to produce an enzyme 
whose product has RNase activity. (Sancho and Fersht, 
1992, J.Mol.Biol., 224, 741-747). 

3 0 RNase A can also be used. It was shown, as long ago as 
1959 (Richards and Vithayathil, J.Biol .Chem. , 234, 1459- 
1465) that RNase A can similarly be split by mild 
proteolytic treatment into two polypeptides which can 
then reassociate and produce an active enzyme. 
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In order to implement a system, according to the present 
invention, involving artificially split proteins, it may 
be necessary to design genetic constructs in order to 
5' express the polypeptides therefrom. In order to design 
the genetic constructs whose products will associate to 
form the active enzyme some modifications may be 
required. For example, a methionine codon can be added 
in front of the ORF encoding the second half of the 

10 active enzyme and a stop codon can be added after the 
ORF encoding the first half of the active enzyme. If 
the polypeptides are expressed as the C terminal part of 
a translational fusion to another protein or to a 
protein targeting sequence, then a start codon may be 

15 absent from the ORF of polypeptide A and/or polypeptide 
B, whereas a stop codon is still needed to terminate the 
ORF of polypeptide A and polypeptide B, respectively. 
If polypeptide A or B is expressed as the N- terminal 
part of a translational fusion to another protein, then 

20 the • ORF of polypeptide A or B will start with a 
methionine codon whereas the termination codon is 
provided by the ORF of the other protein to which it is 
fused. Such genetic construct design is commonplace 
and well known to the person skilled in the art. 

25 

The invention may also be practised by expressing two 
portions of two different enzymes that together give a 
different activity than either of the intact parent 
proteins. 

30 

Preferably, both parent plants are homozygous with 
respect to the gene sequences encoding polypeptide A or 
polypeptide B. Such genotypes ensure that all offspring 
will express the active enzyme, regulatory protein or 
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protein which affects the structural integrity of the 
cell . 

If one' or more of the polypeptides (A or B) is/are small 
5 and there are doubts that any of them will be stable in 
a cell, it is possible to use well -known systems wherein 
the small polypeptide is fused in frame to a "carrier 
protein" which protects it from being degraded or 
increases its proteolytic stability, but retains its 
10 freedom to interact with the other polypeptide (s) to 
form the active enzyme, regulatory protein or protein 
which affects the structural integrity of a cell. 

The carrier protein can be chosen so that the 
15 polypeptides A or B are not affected by the fusion. One 

suitable carrier protein is the [^-Glucuronidase (GUS) 
protein, which tolerates addition to its NH 2 end, and is 
a good reporter gene in plants. In this case, one can 
use the level of GUS activity to evaluate the expression 

20 level of the fused small polypeptide. This, can be 
useful for screening primary transf ormants . Another 
suitable carrier protein is ubiquitin (Hondred and 
Vierstra, 1992, Curr. Opin. Biotechnol . 3, 147-151; 
Vierstra, 1996, Pant Mol . Biol., 32, 275-302). When 

2 5 fused in frame to the carboxy- terminus of ubiquitin, 
proteins accumulate significantly in the plant 
cytoplasm. In addition artificial ubiquitin protein 
fusions resemble natural ubiquitin extension proteins, 
e.g. UBQ1 of Arabidopsis thaliana (Callis et al . , 1990, 

30 J. Biol. Chem., 265, 12486-12493), in that they are 
cleaved precisely at the C-terminus of ubiquitin (after 
Gly 76 by specific endogenous proteases. This process 
releases the " attached" protein or peptide moiety from 
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the fusion protein and thus permits polypeptide A and B 
to assemble into a functional enzyme or protein. Also, 
for the purpose of protecting small proteins from 
cytoplasmic proteolysis, translational products may be 
5 enlarged by fusing them to protein targeting signals, 
e.g. the C-terminus (Whelan and Glaser, 1997, Plant Mol . 
Biol. 33, 771-781) and be directed to specific locations 
in the cell such as to mitochondria. A suitable signal, 
for example, is the one found in the A0X3 protein of 

10 soybean (Finnegan and Day, Plant Physiol., 1997, 114, pp 
155) which would add 50 amino acids to polypeptide A and 
B, respectively. Import associated proteolytic 
processing will remove the targeting signal by cleavage 
after Met50 thereby releasing the free polypeptides A 

15 and B into the mitochondria where they combine to 
disrupt mitochondrial function and thus to compromise 
cell viability. 

In some cases, when expressed in two or more portions, 

2 0 the polypeptides may not spontaneously associate, 

assemble, interact or come together in vivo to reform an 
active protein, or regulatory enzyme or protein which 
affects the structural integrity of a cell . In other 
cases the association of the polypeptides may be weak so 
25 that little active reconstituted protein is formed. To 
circumvent these problems, each protein portion may be 
linked to a protein dimerisation domain, thus enabling 
the portions to be brought together in vivo. Such 
protein dimerisation domains are found in many proteins 

3 0 that naturally form dimers or multimers and the linking 

technique is well known in the art. 

For example, the human cysteine-rich protein LIM double 
zinc finger motif has been fused to the Gal4 and VP16 



WO 98/37211 



- 19 - 



PCT/GB98/00542 



proteins. In. contrast to the unmodified Gal4 and Vpl6 
proteins the LIM-Gal4 and LIM-VP16 associate in vitro 
and in vivo (in NIH 3T3 mammalian cells) forming an 
active transcription factor (Feuerstein et al., 1994, 
5 Proc. Natl. Acad. Sci. U.S.A. 91, 10655-10659). The LIM 
motif is found in many organisms. For example, a 
sunflower pollen specific protein with a LIM domain has 
been identified (Batlz et al . , 1996, Plant Physiology 
(Supplement III, 59) . Other protein dimerisation 

10 domains exist such as the leucine zipper (Turner, R. and 
Tijian R., 1989 Science, 243, 1689-1694), the helix- 
loop-helix (Murre et al., 1989, Cell, 56, 777-783), the 
ankyrin Blank et al., 1993, Trends in Biochemical 
Sciences, 17, 135-140) and the PAS (Huang et al., 1993, 

15 Nature, 364, 259-262) domains. 

One may also wish to ensure that the genes "encoding 
polypeptides A or B are inserted in the genomes of 
parents a and b at an identical position (or at tightly 

20 linked positions) so that their chance of co- segregation 
in the transgenic hybrid is low. This can be 
advantageous, for example in the production of hybrid 
seed since the two genes that are used to create the 
male-sterile parent plant will subsequently segregate. 

25 Thus, Fl hybrid progeny are 100% male fertile since no 
hybrid plant can inherit both components of the male- 
sterility system. 

The gene sequences carried by the parent plants a and b 
3 0 which encode part of the active enzyme, regulatory 
protein or protein which affects the structural 
integrity of a cell may be from a different organism. 
The gene sequences do not have to be plant derived and 
include genes from microbial or other sources. For 
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example, the- gene sequences may be Arabidopsis 
endogenous sequences in maize or tomato parent plants. 
Also, the gene sequences may be those which, in 
combination with a tissue specific promoter, are 
5 expressed in a tissue in which the gene sequences are 
not normally expressed. 

According to a second aspect of the invention there is 
provided a method for producing a plant having a desired 

10 phenotype of an active enzyme, a regulatory protein or a 
protein which affects the structural integrity of a 
cell, the method comprising crossing a first plant line 
with a second plant line wherein the first line contains 
one or more gene sequences encoding a polypeptide A 

15 which is part of an active enzyme, regulatory protein or 
protein which affects the structural integrity of a cell 
but which line does not have the phenotype and wherein 
the second line contains one or more gene sequences 
encoding a polypeptide B which is complementary to the 

20 polypeptide or protein A but which line does not have 
the desired phenotype. Here, the term "complementary^ 
means that when expressed in the same plant the 
polypeptides A and B associate, interact or come 
together to form the phenotype of an active enzyme, a 

25 regulatory protein or protein which affects . the 
structural integrity of a cell.' 

Such a method may incorporate one or more of the 
features described above for the first aspect of the 
30 invention and the invention contemplates the application 
of these aspects according to the second aspect of the 
invention. 

According to a third aspect of the invention there is 
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provided a seed or plant obtainable from a pair of 
plants according to the first aspect of the invention or 
by a method according to the second aspect of the 
invention. 

5 

According to a fourth aspect of the invention there is 
provided a seed or plant having a phenotype of an active 
enzyme, regulatory protein or protein which affects the 
structural integrity of the cell, which is caused by the 

10 combined action of two or more transgenes, the 
transgenes not being present on the same copy of a 
chromosome. The preferred embodiments of the first, 
second and third aspects of the invention also apply to 
the fourth aspect. This means that the two or more 

15 transgenes are either on different chromosomes, or on 
different copies of the same chromosome, ie. the plant 
is made by a cross. 

The invention will now be described by the following 
20 non-limiting Examples: 

EX AM P LE X 

Splitting the Barnase Gene into Two Components (Figure 
25 11 

The results of Sancho and Ferscht, 1992, J.Mol.Biol., 
224, 741-747 show that Barnase activity can be obtained 
by combining a peptide A containing amino acids 1 to 36 
3 0 of the mature Barnase protein and peptide B containing 
amino acids 37 to 110 of the mature Barnase protein. 
The allele of Barnase which is described in Sancho and 
Ferscht is a mutant which has a methionine at position 
36, allowing cyanogen bromide to cleave between 36 and 
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37 and produce the 2 peptides. The following genetic 
constructs, to express the peptides, were prepared: 

Peptide A: 

5 

i. A Barnase gene with a methionine codon (amino acid 
position -1) added before codon 1 of the mature Barnase 
sequence so that translation can take place as described 
in Paul et al, 1992, Plant Mol.Biol., 19, 611-622. 

10 

ii. An ORF coding for a peptide called A*, containing a 
methionine followed by amino acids 1 to 35 of mature 
Barnase protein followed by an Ochre stop codon. 

15 

iii. A gene made of ORF A* under control of the A9 
promoter by using IPCR on our plasmid p3079, which 
contains the AMS gene pA9-Barnase (as in i. above) - 
Bar star - CaMV 3' region. (See Figure 2). 

20 

Plasmid p3079 was constructed by cloning a fragment 
containing the ORFs for Barnase-Barstar , obtained by PCR 
from pWP127 (Paul et al, 1992, supra) , in our plasmid 
pl415, which is a derivative of pWP91 (WO-A-9211379) 

25 where the EcoRV restriction site has been converted to 

Hindlll. IPCR was then performed on p3079 using primers 
B3 and B4 (see Figures 1 and 2) designed so that the 
sequence between codon 36 of Barnase and stop codon of 
Barstar is not part of the amplified product. The IPCR 

3 0 amplified sequence was then circularised by ligation and 
the resulting plasmid was introduced into E. coli The 
plasmid was then prepared, cut with EcoRI and the 
fragment containing the ORF A* was cloned in the EcoRI 
sites of pl415, so that ORF A* would be under the 
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control of the A9 promoter from a sequence not treated 
by PCR. The resulting plasmid p2022 contains ORF A* in 
the A9 expression cassette . 

5 iv. An ORF coding for a peptide called A**, comprised 
of a start methionine codon followed by amino acids 1 to 
36 of the mutant Barnase (Sancho and Ferscht, 1992, 
supra) but lacking a stop codon. 

10 This was obtained by PCR on template p2022 with primers 
B5 (retaining the Xbal site at the 5' end) and B6 
generating a blunt 3' end. 

v. A gene made of the translational fusion of ORF A** 
15 and the ORF of (Gly4 Serg/GUS under the control of the 

A9 promoter, the product of which shows peptide A fused 
in frame to the N-terminus of (Gly4 Ser)3/GUS (Figure 
ID) . 

20 This was obtained by replacing the S -peptide ORF in 
plasmid p2028 (see example 3) with the ORF of plasmid 
A** (iv) . For ORF replacement an IPCR was performed on 
plasmid p2028 using primers B7 (retaining the Xba site 
at the 5' end) and B8 (generating a blunt 3' end) to 

25 delete the region encoding the S-peptide from the S- 
peptide-GUS translational fusion. After digest with 
Xbal, the PCR fragment encoding peptide A** (iv) was 
inserted Xbal /blunt into the acceptor DNA generated by 
IPCR. 

30 

vi. An ORF coding for peptide A***, essentially 
identical to peptide A**(iv) but lacking a methionine 
start codon and containing an amber stop codon. 
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This was obtained by PCR on template p2022 using primers 
B9 (producing a blunt 5' end) and BIO (introducing a 
BamHI site at the 3' end). The 3' end of the PCR 
product was digested with BamHI for construction of the 
5 ubiquitin-petide A*** translational fusion (below) . 

vii. A gene made of the translational fusion of genomic 
DNA encoding ubiquitin and the ORF A*** under the 
control of the A9 promoter, the product of which shows 
10 peptide A*** fused in frame to the C terminus of 
ubiquitin (Figure IE) . 

The genomic DNA encoding ubiquitin was obtained by PCR 
from chromosomal DNA of AraJbidppsis thaliana. The PCR 

15 primers Ubql6F and UbqlR were designed to amplify the 
ubiquitin encoding sequence from the extension protein 
gene UBQ1, first described by Callis et al. (1990, 
supra) . Restriction sites for Xbal (at 5' end) and 
BamHI (at 3' end) , introduced during thermocycling, were 

2 0 used to clone the PCR fragment under the control of the 
A9 promoter of pl415 digested with Xbal and BamHI to 
yield plasmid p3245. IPCR was then performed on p3245 
using primers UBQla and UBQlb to generate a blunt 
acceptor end immediately after the ubiquitin codon Gly 

25 76 and at the 3' end to reconstitute the BamHI 
restriction site for cloning. ' After BamHI digest this 
construct served as acceptor for the PCR fragment 
encoding peptide A*** (vi) . 

30 Peptide B: 

i. An ORF coding for a peptide called B* which starts 
with a methionine codon followed by codons 37 to 110 of 
the mature Barnase sequence. In effect this transfers 
the methionine 36 of the mutant Barnase gene (Sancho and 
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Ferscht, 1992; supra) from peptide A to peptide B, 
yielding peptides A* and.B*. 

ii. Gene for ORF B* containing the ATG (amino acid 
5 position -1) of Barnase (in p3079) fused to codon 37 of 

Barnase, under control of the A9 promoter, by deleting 
(by IPCR with suitable primers) (see below) ) codons 1 to 
36 of the mature Barnase sequence. 

10 This was done by performing on p3079 an IPCR reaction 
using primers Bl and B2, (Figures 1 and 2) designed so 
that the sequence between codon 2 and codon 36 of 
Barnase is not part of the amplified product (see Figure 
2) . The IPCR product is treated as described above for 

15 ORF A*, and cloned under control of the A9 promoter in 
pl415. The resulting plasmid p2023 contains ORF B* - 
Barstar in the A9 expression cassette. 

iii. An ORF encoding peptide B*** which differs from B* 
20 (i) in that it lacks the start methionine. 

iv. A gene made of the translational fusion of genomic 
DNA encoding ubiquitin and the ORF B*** under the 
control of the A9 promoter, the product of which shows 

25 peptide B*** fused in frame to the C-terminus of 
ubiquitin (Figure IF) . 

IPCR as performed on plasmid p2 023 (above) with primers 
Bll and B12, retaining the Xbal site at the 5' end of B* 
3 0 but removing the ATG start and leaving a blunt 3' end. 
After digest with Xbal, the IPCR product served as an 
acceptor for the ubiquitin encoding DNA. The latter 
sequence was obtained by PCR from plasmid p3245 (above) 
with primers Ubql6F and Ubqlb retaining an Xbal site at 
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the 5' end while leaving the 3' end blunt. After digest 
with Xbal, the IPCR and the PGR product were ligated to 
yield the translational fusion shown. 

5 In Figure 1G: The nucleotide sequences of primers are 
listed which were used for PCR and IPCR, respectively. 

In Fig. 2: Circular plasmid p3079, containing the A9- 
driven barnase/barstar gene (Figure 1) in pl415, served 

10 as template for Inverse PCR. As the PCR primers (Figure 
1) pointed into opposite directions, the IPCR yielded a 
linear double -stranded plasmid DNA from which the region 
in between the 5 1 ends of the annealed PCR primers was 
deleted (below) . Intramolecular ligation would then 

15 yield circular deletion plasmids which were introduced 
into E.coli for further subcloning. 

Also In Fig. 2- : 
lane 1: 

2 0 A schematic (not to scale) representation is shown of 

plasmid p3079. The different structural parts of the 
coding regions are highlighted. ATG and TAA represent 
the start and stop codon of barnase and bars tar, 
respectively. The relative positions of codons 35, 3 6 
25 and 37 of the mature Barnase protein are indicated. 

lane 2 : 

IPCR with primers Bl and B2 deleted codons 1 to 36 of 
the mature Barnase protein. Intramolecular ligation of 

3 0 the linear deletion plasmid then fused the ATG start 

codon to codon 37 yielding the pepB*/barstar region. 

lane 3 : 

IPCR with primers B3 and B4 deleted the sequence 
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downstream of * the barnase codon 35 as indicated. 
Intramolecular ligation of the linear deletion' plasmid 
then fused the barnase codon 35 to the barstar stop 
codon yielding the pepA* sequence. 
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EXAMPLE 2 

£l£a£ — TranstoCTretion — w * th tlie ge ne ti c constructs in 

Example i 

5 

Genes pA9-A* and pA9-B* expressing a polypeptide A and a 
polypeptide B from the A9 promoter (W092/11379) were 
cloned into derivatives of the plant transformation 
vector pBinl? Beven et al., 1984, Nucl . Ac. Res. 12, 

10 8711 Frish et al., 1995 Plant Mol . Biol., 27, 405-409 
and Arabidopsis plants containing pA9 -polypeptide A, or 
pA9 -polypeptide B, or both genes, were obtained. Plants 
containing both genes were male sterile, whereas plants 
containing one gene were unaffected by the transgene. 

15 Plants with one gene were allowed to self, their progeny 
was harvested, and was analysed to identify homozygous 
and heterozygous Tl plants. Tl plants with pA9- 
polypeptide A were crossed with Tl plants with pA9- 
polypeptide B. The hybrid seeds obtained displayed the 

20 predicted phenotype: wild type if containing one gene 
only, and male sterile when containing the two genes. 

Genes are introduced into maize and into tomato by 
biolistic or Agrobacterium-mediated transformation, and 
25 plants are regenerated and assessed for male fertility 
in a similar way. (Mornish et al., 1990 Biol /Technology 

8, 833-839 and Fillati et al ., 1987 Bio/Technology 5, 
726-7390. 

30 E XAMPLE 3 



S plitting an RNAseA gene into two components 

(Figures 3 and 4) 
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From the work of Richards and Vithayathil (1959 supra), 
we know that the enzyme RNAseA can be cleaved (by the 
protease subtilisin) to generate two polypeptides: the 
S-peptide contains amino acids 1 to 20 of RNaseA, and 
5 the S-protein contains amino acids 21 to 124 of RNaseA. 
When combined, the S-peptide and the S-protein 
associate, and reconstitute an active enzyme. The last 
5 amino acids of the S-peptide are not needed for 
reconstituting RNaseA: a smaller S-peptide made of amino 
10 acids 1 to 15 is sufficient. Genes which express the 
S-peptide . and the S-protein under control of the A9 
promoter were used to develop a system according to the 
invention. 

15 The starting material was a synthetic gene coding for 
bovine pancreatic RNAseA (Vasantha and Filpula, 1989, 
Gene 76 53-60) . A gene coding for the ORF of RNaseA was 
made using synthetic oligonucleotides (see Figures 3A 
and 3B) . The nucleotide sequence of the gene was 

20 designed to be compatible with maize codon usage, 
according to Fennoy and Bailey-Serres, 1993 Nuc. Acids 
Res., 21, 5294-5300. PCR with suitable primers was 
used to amplify from the full length ORF. The following 
ORES were built: 

25 

S-peptide: 

i. An ORF for the S-peptide containing a methionine 
translation initiation codon followed by codons.l to 15 
of the mature RNaseA sequence (see Figures 4A and 4B) 

3 0 and terminated by an Ochre stop codon. 

ii. An ORF made of a methionine translation initiation 
codon followed by codons 1 to 15 of the mature RNAseA 
sequence, followed by a linker sequence encoding (Gly4- 
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Ser)3 (see Figures 4A and 4B) . This gene was designed 
so that it can be fused in frame to the ORF of the GUS 
protein by cloning in the BamHI site of plasmid p2027 
which ■ contains the GUS gene from pBI101.3 (Jeifferson, 
5 1987 Plant Mol .Biol .Reporter, 5 387-405). 

iii. A translational fusion comprising the ORF of the 
mitochondrial protein targeting sequence of AOX3 protein 
from soybean (Finnegan and Day, 1997, Plant Physiol. 

10 114, pp455) and the ORF of S-peptide as described in (i) 
but lacking- the methionine translation initiation codon 
(Figure 4C) . The gene product of said translational 
fusion shows the S-peptide fused to the C-terminal end 
of the targeting sequence. 

15 

iv. A translational fusion comprising the ORF of the 
mitochondrial protein targeting sequence of AOX3 protein 
(supra) and the ORF of the S-peptide-GUS fusion as 
described in (ii) but lacking the methionine 

2 0 translational initiation codon (Figure 4C) . The gene 
product of said translational fusion shows that the S- 
peptide-GUS protein fused to the C-terminal end of the 
targeting sequence. 

25 S-protein: 

i. An ORF for the "S-protein +5", which contains a 
methionine translation initiation codon followed by 
codons 16 to 124 of mature RNAseA sequence and is 
terminated by an Ochre codon. 

30 

ii. An ORF for the S-protein which contains a 
methionine translation initiation codon followed by 
codons 21 to 124 of mature RNAseA sequence and is 
terminated by an Ochre codon. 
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iii. A translational fusion comprising the ORF of the 
mitochondrial protein targeting sequence of A0X3 protein 
(supra J and the ORF of the S-protein as described in 
5 (ii) but lacking the mathionine translational 
intitiation codon (Figure 4C) . The gene product of said 
translational fusion shows the S-protein fused to the C- 
terminal end of ubiquitin. 

10 iv. A translational fusion comprising genomic DNA 
encoding ubiquitin and the ORF of the S-protein as 
described in (ii) but lacking the methionine 
translational intitiation codon (Figure 4C) . The gene 
product of said translational fusion shows the S-protein 

15 fused to the C- terminus of ubiquitin. 

Genes under control of the A9 promoter were then built 
and introduced into plants as in Example 2 . 

2 0 In Fig. 3A: The sequences encoding the S -peptide, the 

S (+5) -protein and the S-protein were constructed by- 
first aligning sense oligonucleotides RN-I to RN-VII 
lanes 2, 5, 7, 9, 11, 13, 16) along complementary guide 
oligonucleotides RN-1 to RN-6 (lanes 3, 6, 8, 10, 12, 
25 14) and then selectively ligating the correctly aligned 
sense oligonucleotides using Taq-DNA-Ligase. 

The ligation resulted in a continuous single DNA strand 
sense) which was subsequently amplified by Vent DNA 

3 0 polymerase (25 PCR cycles) using one of two primer pairs 

as follows: (i) Primers RN-a (lane 1) and RN-b (lane 15) 
amplified the full ligation product. The PCR product 
was gel purified and cleaved with restriction enzymes 
BamHI (underlined, lanes 1 and 15) and Bglll underlined, 
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lanes 2 and 4} to yield two DNA fragments encoding the 
S-peptide and the S(+5) protein. The two fragments were 
cloned separately into the BamHI site downstream of the 
pA9 promoter in plasmid pl415 to yield plasmids p4837 
5 (S-peptide) and p4838 (S+5 protein) . (ii) Primers RN-d 
(lane 4) and RN-b (lane 15) amplified the coding 
sequence of the S -protein. The PCR product was cloned 
as described in (i) to yield plasmid p4839 (S-protein) . 



10 lane 1: 
lane 2 : 



lane 3 
lane 4 
lane 5 



15 



20 



25 



30 



lane 6 : 
lane 7 : 

lane 8 : 
lane 9: 

lane 10: 
lane 11; 

lane 12 : 
lane 13 : 

lane 14 
lane 15 
lane 16 



PCR primer (sense) RN-a 
Oligonucleotide EN- I and alignment to 
oligonucleotide RN-II 

Guide oligonucleotide RN-1 (antisense) 
PCR primer (sense) RN-d 

Oligonucleotide RN-II (continued from lane 2) 
and alignment to oligonucleotide RN-II IN 
Guide oligonucleotide RN-2N (antisense) 
oligonucleotide RN-IIIN (continued from lane 
5) and alignment to oligonucleotide RN-IV 
Guide oligonucleotide RN-3 (antisense) 
oligonucleotide RN-IV (continued from lane 7) 
and alignment to oligonucleotide RN-V 
Guide oligonucleotide Rn-4 (antisense) 
oligonucleotide RN-V (continued from lane 9) 
and alignment to oligonucleotide RN-VI 
Guide oligonucleotide RN-5 (antisense) 
oligonucleotide RN-VI (continued from lane 11 
and alignment to oligonucleotide RN-VII 
Guide oligonucleotide Rn-6 (antisense) 
PCR primer (antisense) RN-b 

oligonucleotide RN-VII (continued from lane 
13) 
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Symbols: 

(5 1 ): non-phosphorylated 5' end 
(5P) : phosphorylated 5' end 
( 3 OH) :* conventional 3' end 
5 (small letters) : bases added for the convenience of 
cloning . 

In Fig. 3B: The sequences encoding the S-peptide with 
the (Gly 4 Ser) 3 -linker peptide were constructed by first 
10 aligning sense oligonucleotides RN-I and RN-VIII (lanes 
2 and 4) along the complementory guide oligonucleotide 
RN-7, and then selectively ligating the correctly 
aligned oligonucleotides using Taq-DNA-Ligase . 

15 The ligation resulted in a continuous single DNA strand 
which was subsequently amplified by Vent DNA polymerase 
(25 PCR cycles) using the primer pair RN-a (lane 1) and 
RN-c (lane 5) . This PCR reaction yielded the full 
length, double stranded ligation product. The PCR 

20 product was gel purified, then cleaved with restriction 
enzymes BamHI (underlined, lane 1) and Bglll 
(underlined, lane 5) and cloned into the BamHI site of 
p2027 to generate an NH 2 -terminal protein fusion to GU3 
under the control of the pA9 promoter (p2027 was 

25 constructed by cloning the GUS coding sequence of 
pBI101.3 as a BamHl/Smal fragment into the BamHI site of 
pl415) . This yielded plasmid p2028. 

lane 1: PCR primer (sense) RN-a 
30 lane 2: Oligonucleotide RN-I encoding the S-peptide as 
in Figure 3a and the alignment to 
oligonucleotide RN-VIII encoding the (Gly4- 
Ser)3 linker peptide 
lane 3: Guide oligonucleotide (antisense) RN-7 
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lane 4: Oligonucleotide RN-VIII (continued from lane 
2) 

lane 5: PCR primer (antisense) RN-c 

5 Symbols: 

(5'): non-phosphorylated 5" end 
(5P) : phosphorylated 5' end 
(30H) : conventional 3 1 end 

(small letters) : bases added for the convenience of 
10 cloning 

In Figure 4A: The protein and DNA sequence is shown for 
S-peptide and the S-peptide with (Gly4 Ser) 3 linker. 
The S-peptide linker sequence was fused in frame to GUS 
15 to yield plasmid p2028 as described for Figure 3B. 

In Figure 4B: The ORF for (S+5) -protein and S-protein is 
shown as contained in plasmids p4 838 and p483 9, 
respectively. These plasmids were described above for 
2 0 Figue 3A. 



In Figure 4C: 

(i) The mitochondrial protein targeting sequence (short 
of the last four amino acids: Leu-Arg-Arg-Met) was 

25 obtained by PCR with primers AOX3MI1 and AOX3MI2 from a 
plasmid which contained the cDNA of Alternative Oxidase 
(AOX3) of soybean as published by Finnegan and Day, 1997 
(Plant Physiol. 114, pp455) . Restriction sites (Xbal 
and Bglll at the 5' end and Aflll and BamHI at the 3' 

30 end) were introduced during the thermocycling to yield 
the PCR product which was cloned Xbal /BamHI downstream 
of the A9 promoter in pl415. This plasmid was called 
p0200. 
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(ii) Primers SPEPMI1 and SPEPMI2 were then used to 
produce from plasmid p4837 a PCR fragment .encoding 
within and downnstream of an Aflll restriction site the 
missing four amino acids (Leu-Arg-Arg-Met) of the 

5 mitochondrial targeting signal followed by the ORF of S- 
peptide. A PCR generated BamHI site at the 3' end 
allowed cloning of the PCR fragment as an Aflll/BamHI 
fragment into p0200. This cloning yielded plasmid 
p0203, containing the complete ORF of the translational 
10 fusion as shown. 

(iii) The translational fusion of mitochondial targeting 
sequence and ORF of S -peptide -GUS was generated in a 
similar fashion as described in (ii) except that PCR 

15 primers SPEPMI1 and SPEPMI2 were used on template p2028 
to generate an Aflll/BamHI fragment that was cloned into 
p0200 to yield p0204. 

(iv) The translational fusion of mitochondrial targeting 
20 sequence and ORF of S-protein was generated in a similar 

fashion as desccribed in (ii and iii) , except that PCR 
primers SPROTMI1 and SPR0TMI2 were used ontemplate p4838 
to generate an Aflll/BamHI fragment that was cloned into 
p0200 to yield p0202. 

25 

(v) A PCR fragment was generated from template p4839 
with primers SPROTF and SPROTR containing the ORF of S- 
protein in between BamHI restriction sites at either 
end. After digestion with BamHI this PCR fragment was 

3 0 cloned into the BamHI site of p3245 which yielded the 
translational fusion in p3249 of genomic ubiquitin DNA 
and S-protein as shown. 
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EXAMPL E 4 

Use of the Dimer Protein Apetala3-Pistillata 
Apetala3 (Ape3) and Pistillata (Pi) are two proteins of 
5 Arabidopsis thaliana which are involved in the 
regulat ion of floral differentiation. The genes are 
known while the endogenous pattern of expression in the 
tapetum is not known. Expression of the Arabidopsis 
genes in the maize tapetum leads to disruption of the 
10 normal anther development by activating normally silent 
genes. These genes can also be used to activate, in the 
maize tapetum, an Arabidopsis promoter responsive to the 
Ap3-Pi dimer such as the Ap3 promoter (pAp3) itself. 

15 We have built the following genes: 

pA9-Apetala3 

The cDNA for Ap3 (Jack et al, 1992, Cell 68, 683-697 
GenBank Accession No. M86357) was cloned in the A9 

20 expression cassette of pWP91 (WO-A-9211379) giving 
plasmid p4796. This plasmid contains the Ap3 cDNA with 
approximately 15 bases of 5 ' untranslated sequence 
followed by the whole ORF (698 bases from ATG to TAA) 
followed by approximately 120 bases of 3' untranslated 

25 sequence, cloned in the BamHI site of pWP91. 

pA9-Pistillata 

The cDNA for Pi (Goto and Meyerowitz, 1994, Genes Dev. 
8, 154 8-1560 GenBank Accession No. D30807) was cloned in 
3 0 the A9 expression cassette of pWP91 (WO-A-9211379) 
giving plasmid p0180. This plasmid contains the Pi cDNA 
with approximately 24 bases of 5 1 untranslated sequence 
followed by the whole ORF (626 bases from ATG to TGA) 
followed by approximately 250 bases of 3 1 untranslated 
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sequence, cloned in the Xbal-BamHI sites of pWP91. 
pApetala3 -PRGlucanase 

The A9* promoter sequence in plasmid A9PR (described in 
5 Worrall et al, 1992, The Plant Cell, 4, 759-771) was 
replaced by a 1250 bp (approx) sequence containing the 
Ap3 promoter region, obtained by PCR amplification of 
Arabidopsis thaliana genomic DNA, according to the 
published sequence (Jack et al, 1994 Cell, 76, 703-716), 
10 giving plasmid p4817. 

The genes were introduced in maize in various 
combinations, by biolistic transformation techniques 
known in the art. Plants were regenerated and assessed 
15 for male fertility. 

-p4796 (pA9-Ap3) /p0180 (pA9-Pi) cause male sterility. 
Neither of them alone causes male sterility. 
-p4796/p0180/p4817 (pAp3 -PRGlucanase) cause sterility, 
20 when p4817 with only one of the two transcription factor 
genes does not . 
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1. A pair of parent plants for producing seeds 
comprising: 

5 

(i) a first parent plant containing one or more 
gene sequences encoding a polypeptide or protein A: 
and 

10 (ii) a second parent plant containing one or more 

gene sequences encoding a polypeptide or protein B; 

wherein the polypeptides A, B, when expressed in 
separate plants, do not form an active enzyme, a 

15 regulatory protein or protein which affects the 
functionality and/or viability and/or the structural 
integrity of a cell, but when expressed in the same 
plant do form an active enzyme, regulatory protein, or 
protein which affects the structural integrity of a 

20 cell. 

2 . A pair of plants as claimed in claim 1 wherein the 
one or more gene sequences from at least one of the 
parents is transgenic. 

25 

3. A pair of plants as claimed in claim 1 or claim 2 
wherein the polypeptides or proteins A, B, when 
expressed in the same plant, cause cell ablation, 
especially male-sterility or embryoless seeds. 

30 

4. A pair of plants as claimed in any one of claims 1 
to 3 wherein one of the parent plants is male-sterile. 

5. A pair of plants as claimed in any one of claims 2 
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to 4 wherein . the one or more gene sequences encoding 
both or one of the polypeptides or proteins A, B, is 
operatively linked to a tissue specific promoter; 

5 6. A pair of plants as claimed in any one of claims 1 
to 5 wherein the polypeptides A, B are naturally 
occurring subunits of the protein complex of an active 
enzyme, regulatory protein, or protein which affects the 
structural integrity of a cell. 

10 

7. A pair of plants as claimed in claim 6 wherein the 
polypeptides A, B are two polypeptide subunits of an 
enzyme having RNase activity such as the enzyme Barnase 
or RNase A or the monomers of the protein complex of the 

15 Apelata3-pistillata. 

8 . A pair of plants as claimed in any one of claims 1 
to 5 wherein the polypeptides A, B are artificially 
split polypeptides of an active enzyme, regulatory 

2 0 protein or protein which affects the structural 
integrity of a cell. 

9. A pair of plants as claimed in any one of the 
preceding claims wherein each parent plant is homozygous 

2 5 with respect to the one or more gene sequences encoding 

polypeptide A or B respectively. 

10 . A pair of plants as claimed in any one of claims 3 
to 9 wherein the cause of male-sterility is direct or 

3 0 indirect. 

11. A pair of plants as claimed in any one of claims 5 
to 10 wherein the tissue-specific promoter is a tapetum- 
specific promoter, an embryo- specif ic promoter or a seed 
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specific promoter. 

12 . A pair of plants as claimed in any one of claims 1 
to 11- wherein one or both of the polypeptides or 
5 proteins is fused to a carrier protein and/or a protein 
targeting signal. 

. 13 . A pair of plants as claimed in any one of claims 1 
to 12 wherein each polypeptide or protein A, B is linked 
10 to a protein dimerisation domain of a dimeric or 
multimeric protein sequence that promotes association of 
between subunits A and B. 

14 . A pair of plants as claimed in any one of the 
15 preceding claims wherein the one or more gene sequences 

from at least one of the parent plants is a heterologous 
gene sequence. 

15. A method for producing a plant having a desired 

2 0 phenotype by virtue of an active enzyme, a regulatory 

protein or a protein which affects the structural 
integrity of a cell, the method comprising crossing a 
first line with a second line wherein the first line 
contains one or more gene sequences encoding a 
25 polypeptide or protein but which line does not have the 
desired phenotype and wherein ' the second line contains 
one or more gene sequences encoding a polypeptide or 
protein B which is complementary to the polypeptide or 
protein A but which line does not have the desired 

3 0 phenotype . 

16. A method as claimed in claim 15 wherein the one or 
more gene sequences from at least one of the lines is 
transgenic . 
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17. A method as claimed in claim 15 or claim 16 wherein 
desired phenotype is cell ablation especially male- 
sterility or embryoless seeds. 

5 

18. A method as claimed in any one of claims 15 to 17 
wherein one of the lines is male - sterile . 

19. A method as claimed in any one of claims 15 to 18 
10 wherein the one or more gene sequences encoding 

polypeptides or protein A and/or B is operatively linked 
to a tissue-specific promoter. 

20. A method as claimed in any one of claims 15 to 19 
15 wherein the polypeptides or proteins A, B are naturally 

occurring subunits of an active enzyme, regulatory 
protein or protein which affects the structural 
integrity of a cell. 

21. A method as claimed in claim 20 wherein the 
polypeptides or proteins A, B are two polypeptide 
subunits of an enzyme having RNase activity such as the 
enzyme Barnase, RNase A or the subunits of the protein 
Apelata3 -pistillata . 

22. A method as claimed in aily one of claims 15 to 19 
wherein the polypeptides or proteins A, B are 
artificially split polypeptides of an active enzyme, 
regulatory protein or protein which affects the 
structural integrity of a cell. 

23. A method as claimed in any one of claims 15 to 22 
wherein each line is homozygous with respect to the gene 
sequence encoding polypeptide or protein A, B, 



25 
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respectively. . 

24 . A method as claimed in any one of claims 15 to 23 
wherein the desired phenotypic trait is direct or 

5 indirect male-sterility. 

25. A method as claimed in any one of the claims 15 to 
24 wherein the tissue-specific promoter is a tapetum- 
specific promoter, an embryo-specific promoter or a seed 

10 specific promoter. 

26. A method as claimed in any one of claims 15 to 25 
wherein one or both of the polypeptides or proteins A, B 
is fused to a carrier protein and/or a protein targeting 

15 signal. 

27. A method as claimed in any one of claims 15 to 26 
wherein each polypeptide or protein A, B is linked to a 
different protein dimerisation domain of a dimeric or 

2 0 multimeric protein. 

28. A method as claimed in any one of claims 15 to 27 
wherein at least one of the lines contains, as the one 
or more gene sequences, heterologous gene sequences. 

25 

29. A seed or plant obtainable from a pair of plants as 
claimed in any one of claims 1 to 14 or by a method as 
claimed in any one of claims 15 to 28. 

30 30. A seed or plant, having a phenotype of an active 
enzyme, regulatory protein or protein which affects the 
integrity of a cell, which is caused by the combined 
action of two or more transgenes, not present on the 
same copy of a chromosome. 
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i. PCR amplification product encoding impartial AOX3 targeting signal 

Xbal / Bglll 

tctagatcttaac ATGAAGAATG TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG 
TGGGCGGAGC TACTACCGCC AGCTCTCAAC GGCGGCGATC GTGGAACAGA 
GACACCAGCA CGGTGGCGGC GCGTTTGGAA GCTTCCA ct taagcggatcc 

Aflll / BamHI 



ii. ORF encoding AOX3 targeting sequence (underlined) and S-peptide 

ATGAAGAATG TTTTAGTAAG SIC&figBSCS CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 
TACTACCGCC AGCTCTCAAC ggcggcgatc GTGGAACAGA GACACCAGCA GCGTGGCCCC 

GCSHZSGAA GCTTCCACTT AAGAAGGATG AAGGAGACCG CCGCCGCCAA GTTCGAGCGC 
CAGCACATGG ACAGCTAA 



Hi. ORF encoding AOX3 targeting sequence (underlined) 
and S-peptide-(G!y4 Ser)3-GUS 

ATGAAGAATG TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 

X&gcaCCSCC &SCICECft&C ggcggcgatc GISSflAC&Sfl GACACCAGCA CGGTGGCGGC 
GCGTTTGGAA GCTTCCACTT AAGAAGGATG AAGGAGACCG CCGCCGCCAA GTTCGAGCGC 
CAGCACATGG ACAGCGGCGG TGGCGGTTCC GGTGGCGGTG GCAGCGGCGG CGGTGGTAGC 
GGGATCCCCG GGTACGGTCA GTCCCTTATG — > GUS 



iv. ORF encoding AOX3 targeting sequence (underlined) 
and S-protein 

ATGAAGAATG TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 

tactaccgcc AGCTCTCAAC ggcggcgatc GTGGAACAGA GACACCAGCA CGGTGGCGGC 
GCGTTTGGAA GCTTCCACTT AAGAAGGATG AGCTCCTCCA ACTACTGCAA CCAGATGATG 
AAGTCTAGGA ACCTGACCAA GGACAGGTGC AAGCCAGTCA ACACCTCCGT CCACGAGAGC 
CTGGCCGATG TCCAGGCCGT CTGCAGCCAG AAGAACGTGG CCTGCAAGAA CGGTCAGACC 
AACTGCTACC AGTCCTACAG CACCATGTCC ATCACCGACT GCCGCGAGAC CGGCTCCAGC 
AAGTACCCTA ACTGCGCCTA CAAGACCACA CAGGCCAACA AGCACATCAT TGTTGCCTGC 
GAGGGTAACC CTTACGTGCC TGTCCACTTC GACGCCTCCG TCTAA 



v. Transiational fusion of Ubiquitin genomic sequence and ORF of S-protein 

ATGCAGATCT TCGTGAAAAC CTTGACCGGC AAGACCATCA CTCTCGAGGT CGAGAGCAGC 
GACACCATCG ACAATGTCAA GGCCAAGATC CAAGACAAAG AA GGTATCAT TCTTCCTCAC 

TCAATCTGGA TTCTTCTCTT tagctttttg aaattcagat ctcttatcat TTACTTGTTX 

CTCCTTTAAG GAATCCCTCC GGATCAGCAG AGATTGATCT TCGCCGGAAA GCAGCTCGAA 
GATGGCCGTA CTTTGGCTGA CTACAACATC CAGAAAGGTA CGAAATCATC CGAATCCTTC 

TGTTGATCAT TTCGATGATC tgattgtata aactctaatg gattgttatc ATTTGTAAAC 
AGAATCTACA CTTCATCTTG TGTTGAGGCT TAGAGGtGOa tcCagCTCCA ACTACTGCAA 
CCAGATGATG AAGTCTAGGA ACCTGACCAA GGACAGGTGC AAGCCAGTCA ACACCTCCGT 
CCACGAGAGC CTGGCCGATG TCCAGGCCGT CTGCAGCCAG AAGAACGTGG CCTGCAAGAA 
CGGTCAGACC AACTGCTACC AGTCCTACAG CACCATGTCC ATCACCGACT GCCGCGAGAC 
CGGCTCCAGC AAGTACCCTA ACTGCGCCTA CAAGACCACA CAGGCCAACA AGCACATCAT 
TGTTGCCTGC GAGGGTAACC CTTACGTGCC TGTCCACTTC GACGCCTCCG TCTAA 

Underlined: introns A and B within the ubiquitin encoding sequence 
Bold: codon for Glycine 7 ^ marking the C-terminus of the ubiquitin. 

Small letters: PCR introduced conservative codon changes to generate a BamHI site 
and to modify the codon usage 
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FIG. UD 

Nucleotide sequence of T PCR primers (example 3) 



Sprot F 
Sprot R 

SprotMIl 
SprotMO 

SpepMIl 
SpepMLZ 
SpepGMI2 

AOX3MI1 
AOX3MI2 



GGTGGATCCAGCTCCAACTACTGCAAC 3' 
CGGGATCCTTAGACGGAGGCGTCG 3' 

GTCCTTAAGAAGGATGAGCTCCTCCAACTAC 3 ' 
CGGGATCCTTAGACGGAGGCGTCG 3 1 

GTCCTTAAGAAGGATGAAGGAGACCGCCG 3' 
TCGGGATCCTTAGCTGTCCATGTGCTG 3' 



5' TGCTCTAGATCTTAACATGAAGAATGTTTTAG 3' 
5' TCGGATCCGCTTAAGTGGAAGCTTCCAAAC 3' 



SUBSTITUTE SHEET (RULE 26) 



WO 98/37211 



PCT/GB98/00542 



12/12 

FIGURE SHOWING A PRODUCTION SCHEME OF EMBRYO LESS MAIZE GRAINS: 
LINES A AND B ARE SOWN IN ALTERNATIVE ROWS (FOR EXAMPLE ONE MALE 

AND FOUR FEMALES) 




LEGEND 
(REFER TO 
DESCRIPTION 
FOR DETAILS) 



MALE PARENT A 



FEMALE PARENT B 



FIG. 5 
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